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Abstract. We study word metrics on Z d by developing tools that are fine 
enough to measure dependence on the generating set. We obtain counting 
and distribution results for the words of length n. With this, we show that 
counting measure on spheres always converges to a limit measure on a limit 
shape (strongly, in an appropriate sense). The existence of a limit measure 
is quite strong — even virtually abelian groups need not satisfy these kinds of 
asymptotic formulas. Using the limit measure, we can reduce probabilistic 
questions about word metrics to problems in convex geometry of Euclidean 
space. As an application, we give asymptotics for the spherical growth function 
with respect to any generating set, as well as statistics for other "size-like" 
functions. 



1. Introduction 

Suppose one wants to study the density of group elements that have a certain 
property, or the average value of some statistic in a group. If the group is finitely 
generated by a set 5, then there is an associated word metric on the group that 
measures how far an element is from the identity, and the ball of radius n is a finite 
set. Arguably the most natural approach to a density question is to put counting 
measure on the ball of radius n, measure the proportion of those points with the 
desired property, and let n tend to infinity. 

Given a group G with a fixed finite generating set S (say symmetrized, so that 
S = S^ 1 ), let S n denote the sphere of radius n in the Cayley graph, which is just 
the set of group elements whose distance from the identity in the word metric is 
exactly n; that is, they are group elements whose minimal spelling has n letters, or 
which are reached by geodesies of length n based at the identity. Likewise, B n is 
the (closed) ball of radius n. Then a reasonable way to consider the density in G 
of a property with respect to a generating set is to find the expectation over large 
balls B n . Furthermore, one might be interested in understanding the frequency of 
a property among long words, which amounts to finding the expectation over large 
spheres S n — a strictly harder problem, as we will demonstrate. 

More generally, we will study the averages J2xeB n /( x ) anc ^ SxgS„ /( x )> 
not just for characteristic functions. For some functions we will find that these 
averages grow on the order of n k , in which case we normalize the average and seek 
a limit, or the coefficient of n k . 

We will show that averages for "size- like" functions over spheres in (Z d , S) must 
exist; further, the averages can be reduced to integrals on convex polyhedra in 
Euclidean space, with respect to an appropriate geometrically defined measure. 
One of the themes will be to study statistics that are a priori dependent on the 
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choice of generating set. In some cases, we will be able to quantify the extent of 
the dependence; in other cases, we will find that there is no dependence. 

Recall that a function g : R d — > R is called homogeneous (of order k) if g(ax) = 
a k g(x) for a > 0. Let us call a function / : 7L d — > R coarsely homogeneous if 
there is some homogeneous function g : R d — > R such that / x g, meaning that 
\g(x) — /(x)| is uniformly bounded over x G 7L d . We say that / is asymptotically 
homogeneous if there is some homogeneous function such that / ~ g, meaning that 
the ratio /(x)/g(x) — > 1 as x — > oo. (Here the notation x — > oo means that sequences 
leave all compact sets.) In particular, coarsely homogeneous implies asymptotically 
homogeneous when k > 1 and g is nonzero. 

For any free abelian group Z d with any finite generating set S, let Q be the 
convex hull of the points corresponding to S in R d , and let L be its boundary 
polyhedron, and let A denote the cone from A C L to the origin, so that Q = L. 
For A C L define the cone measure by 

... ... Vol(i) 

This is the Euclidean volume of the cone from A to the origin normalized by the 
volume of Q, so that /i = /i L is a Borel probability measure on L (and we will 
suppress L from the notation when it is understood). As we will discuss below, it 
is not hard to show that S n looks more and more like the the dilate nL, or in other 
terms, that —S„ — > L as a Gromov-Hausdorff limit. Our main contribution is to 
establish that counting measure on spheres limits to the cone measure on L in an 
appropriate sense to carry out averaging operations Q 







Figure 1. Six arcs are shown in red in this figure, each having 
cone measure 1/14; in other words, all of the colored regions have 
1/14 as much area as the convex body they are in. In the square 
and the hexagon, all sides have equal measure because in each ex- 
ample the triangles subtended by the sides are mutually congruent. 
On the other hand, for this octagon generated by the chess-knight 
moves {(±2, ±1), (±1, ±2)}, the measure of its two types of sides 
(shown with green and blue) is in the ratio 4 : 3. Cone measure is 
defined on any convex, centrally symmetric figure, and in particu- 
lar it is uniform on the circle. 



^We note that working in the asymptotic cone, R d , would be a substitute for the rescaling by 
dilations; furthermore, this suggests a natural generalization of these questions to other groups. 
To do this, one would need a theory of ultralimits of measures; in logic, this goes by the name of 
Loeb measure. These ideas have not yet been imported into geometric group theory, a translation 
that seems as though it would be quite fruitful. 
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Theorem 1.1 (Sphere averages). For any finite presentation (Z d , S) and any func- 
tion f : Z rf — > K asymptotic to g : M. d — > K wii/i 5 homogeneous of order k, let 
L = dCHull(S) and let \x L be cone measure on L. Then 

^-^/(x) = (^ 5 ).n fe + 0(n fc - 1 ), 
|An| xes„ 

wzt/i coefficient v g ^s '■= J L s(x) d/j, L (x). 

Notice that this limit measure /i is uniform on each face of L, in contrast to 
the hitting measure for random walks in the word metric, which has a Gaussian 
distribution. This is of interest because randomness problems in groups are often 
approached by studying asymptotics of random walks, rather than probabilities 
with respect to the word metric. This shows that they are in general quite different. 

To establish Theorem [HJ we prove a counting lemma relating the distribution 
of sphere points to the points of a lattice, and then use Ehrhart polynomials to 
relate lattice points to volume. As a consequence of this main theorem, we get the 
leading term for the average value over balls in the word metric. 

Remark 1.2 (Ball averages). With the same assumptions as above, let Q = 
CHull(S'). Then 

T^l E f(x) = (V g ,s)-n k + 0(n k - 1 ). 

with coefficient V g .s '■= /q g(x) dVol(x). 

Crucially, one may observe that this statement about ball averages can be ob- 
served much more easily than the sphere averages in the main theorem. (Just use 
the fact that ^B n becomes uniformly distributed in Q, and that the error term 
is counted by points in a region of lower-order volume.) Going further, one can 
deduce from ball asymptotics that if the counting measures on ^S n do converge 
to a measure on L, then it must be cone measure: that is the unique measure on 
L which, considering the necessary scaling properties, is compatible with Lebesgue 
measure on Q. However, there is no guarantee that the limit in Theorem 11.11 exists 
at all, even given the limit statement in Remark 11.21 In the next section, we will 
give further examples to illustrate that the sphere problem is strictly harder than 
the ball problem. 

As a consequence of Theorem 11.11 we can deduce a distribution result: the 
ball average for size-like functions compares to the sphere average by a simple 
multiplicative factor which is independent of the generating set. This factor depends 
only on the dimension d and the growth order k. 

Theorem 1.3 (Spheres versus balls). For any function f : V 1 — > R that is asymp- 
totically homogeneous of order k, 

lim - — rf(x) = I I lim —rf(x). 

That is, the coefficients of growth for sphere averages and ball averages are related 
by the simple expression V 9t s = f^+fcj v g,S- 

To put this result in context, we remark that there are three situations in which 
it is clear that the sphere-average should equal the ball-average. One case is that 



4 



DUCHIN LELIEVRE MOONEY 



of any function averaged over a group with exponential growth, where almost all 
of the points on the ball will be concentrated on its boundary sphere. Alternately, 
for any function averaged over Z d , the points in the ball again become increasingly 
concentrated in the boundary as d —¥ oo. Finally, sphere-averages clearly equal 
ball-averages for those size-like functions on Z d with k = (so / is close to a scale- 
invariant function). These last two cases provide a plausibility check on the 
factor in this theorem. 

Next, we can apply Theorem 1 1.1 1 to reduce problems about the asymptotic study 
of the geometry of Cayley graphs for Z d to problems in convex geometry. 

We find a supply of examples of asymptotically homogeneous functions from 
considering distance in the word metric. Using the standard embedding of Z d 
in K d as the integer lattice, the word metric on the Cayley graph for (Z d , S) is 
within bounded distance of a norm on R d , namely the norm induced by the convex, 
centrally symmetric polyhedron L. We will denote this Minkowski norm by ||x|| L 
and recall that it is defined as the unique norm for which L is the unit sphere. Then 
it is a basic fact (Lemma 13.51 below) that there is a uniform bound K such that 

Ml < Ixl < Ml + K 

for all x s Z d , where • | is the word length in the Cayley graph and K is the largest 
distance in the word metric from the identity to any lattice point in Q. Burago 
proved more generally that periodic metrics on R d are at bounded distance from 
norms in [3] ; we give a hands-on proof with the optimal constant for word metrics 
here. This ensures that distance in the Cayley graph for any finite generating set 
can be regarded as a coarsely homogeneous function. It is also immediate that 
f(x) — |x| p is asymptotically homogeneous of order p, and it follows from this that 
the pth moments of a word metric — expected position in a large ball, variance and 
standard deviation, skewness, and so on — are in a sense independent of the choice 
of finite generating set (discussed below). As another application of our counting 
results, we find asymptotics for the growth function /3(n) and spherical growth 
function o~(n) — the number of words of length up to n and exactly n, respectively — 
for Z d with an arbitrary generating set. 

Theorem 1.4 (Growth functions). Fixing (Z d ,S) as above, let f3(n) = and 
o~(n) = ftSn be the growth function and the spherical growth function, respectively. 
Then 

/3(n) = Vn d + Oi^- 1 ) ; a{n) = {d-V)n d - 1 + 0{n d ~ 2 ) ; 

where V = Vol(Q) and d is the dimension. 

There were previously known results in extremely special cases, as surveyed in [H 
VI. 2], but this settles the first-order growth asymptotics for 1 d completely. These 
formulas look deceptively simple, but some obvious possible generalizations are not 
true: in the next section, we show an example of a virtually abelian group which 
does not satisfy asymptotic growth formulas of this form. 

Acknowledgments. We thank Alex Eskin and Ralf Spatzier. The first author is 
partially supported by NSF grant DMS-0906086, and the third author is partially 
supported by NSF grant RTG-0602191. 
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2. The sphere problem 

It is a problem of extremely classical interest to count the lattice points in balls 
in metric spaces. In one well-known form, this is the Gauss circle problem: where 
R(n) is the number of standard lattice points in the round disk of radius n centered 
at the origin in the plane, Gauss first proved that 

R(n) = irn 2 + 0{n). 

Given strong enough estimates for functions on balls, one can derive estimates for 
the annular regions A„ = B n \B n _i defined as differences of successive balls. From 
Gauss's estimate in the circle problem we get R(n) — R(n — 1) = 2nn + 0(n), 
which is vacuous: the estimate tells us nothing about lattice points in the annulus. 
Historically, the next progress on the circle problem was by Sierpinski, who proved 
in 1906 that 

R(n) = im 2 + 0(n 2/3 ), 

from which we get R(ri) — R(n — 1) = 27rn + 0(n 2 / 3 ), which is a nontrivial estimate 
for annuli. Finding the optimal error term for the circle problem (conjecturally 
n 1 / 2 + e ) is a deep problem with ties to the Riemann hypothesis. Sphere asymptotics 
for word metrics can be regarded as a group-theoretic version of the Gauss circle 
problem, because for integer- valued metrics such as word metrics, the annular region 
B n \ B n _i is precisely the sphere S n . (And furthermore, the counting results rely 
crucially on enumerating integer points in geometric annuli A n L = nQ \(n— 1)Q.) 
This work studies counting and distribution problems in all finitely generated word 
metrics on free abelian groups. Though we note that our error term in this case is 
already of optimal order, so the problem is certainly not as rich! 

There are several issues that should be clarified at the outset. First, functions 
that can be averaged over balls do not necessarily admit well-defined averages over 
spheres and annuli. Furthermore, the spherical estimates that we obtain in this 
paper are strictly better than the difference of ball estimates, even though the error 
order is optimal in the ball estimates. In this section, we present simple illustrations 
of some of the subtleties. 

Here is a straightforward example in lattice-point counting to explicitly illustrate 
this issue. Take Q to be the unit square \] x [— |, |], L its boundary, and || • \\l 
the corresponding Minkowski norm on R 2 (the norm whose unit circle is L, which 
in this case is half of the sup norm). The ball of radius n with respect to || • \\l is 
the dilate nQ of Q by n. Since these balls are getting "fatter" as n gets large, it is 
straightforward to count the number of lattice points lying inside nQ to first order: 

#Z 2 n nQ = n 2 Arca(Q) + O(n) = n 2 + 0(n). 

However, it does not follow that the number of lattice points in the annuli equals 
2n + 0(1); instead, we get an oscillating sequence. Letting A n L = nQ \ (n — l)Q, 
we find that whenever n is odd, A n L contains no lattice points at all. But when 
n is even, it contains An lattice points. So the sequence #Z 2 n A„L is oscillating 
between An and 0. Correspondingly, there is no well-defined coefficient of n in the 
lattice-point count n 2 + 0(n) given above. Thus, the indicator function of Z 2 has a 
well-defined average over the ball nQ but not over the annular region which is the 
difference of two balls. The same phenomenon, that ball averages are well-defined 
but sphere averages are not, can also be observed in groups with complicated growth 
functions, as in Cannon's example discussed below. 
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In this paper, we will deal not only with geometric indicator functions, but 
also for averaging of more general functions, which can sometimes pose analogous 
difficulties. To take a number-theoretic example, one can study random properties 
of the integers by choosing uniformly over {1, 2, . . . , n} and letting n — > oo. This is 
what is meant by classical statements of analytic number theory (see [6]) such as 
The probability that two integers are relatively prime is 6/n 2 . 

We can express the probability of relative primality in terms of the Euler phi 
function (f>(n); here again, the order oscillates (that is, ^4>{n) has multiple accumu- 
lation points — one for the primes and another for the powers of two, for instance), 
while the average order ^-[^(l) + ■ • • + 4>(n)] converges. With respect to a Cayley 
graph for Z 2 , this means that sphere-averages do not exist, whereas ball-averages 
tend to 6 /it 2 . 

Let us draw the contrast between this situation and that of the size- like functions 
covered by Theorem ll.il In this example, the function / that is the indicator for 
the relative primality of the coordinates is coarsely homogeneous of order zero, but 
not asymptotically homogeneous, so Theorem 1 1 . 1 1 does not apply. Furthermore, for 
size-like functions, unlike for relative primality, the growth order (fc) of the sphere- 
average is the same as for the ball-average. But for size-like functions on groups, 
we will not only achieve overall counting results, but also distribution results which 
count points in every direction, as illustrated in Figure [2] 




Figure 2. For the generating set S = ±{6ex, ei, 6e2, e2}, spheres 
of radius n = 1, 6, and 20 are shown relative to the dilated limit 
shape nL. As n — > oo, the proportion of points in S n that lie in 
any fixed direction is converging. 

We will find below that 

#5„ ~ #(Z d n A n L) ~ Vol(A„£) = aln^-VoliQ). 

The first comparison here reduces the sphere counting problem to the lattice point 
counting problem (counting integer lattice points in a geometric annular region 
between balls in the norm || • and the second comparison is a solution to the 
lattice point counting problem. Counting problems like this can be studied in many 
other kinds of metric spaces. In [5], for instance, the lattice point counting problem 
is solved in the Heisenberg group, and partial results are offered on the sphere 
counting problem. 
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As a final note, our findings that #5„ = dVn^ 1 + 0(n d ~ 2 ) and #B n = Vn d + 
0(n d ~ 1 ) for any finite generating set give stronger statements than those surveyed 
in [3], where it is stated that \m\(#B n )/n d = V for the standard generating set 
and other special cases. Beyond this we are not aware of results in the literature 
prior to the present work concerning asymptotics of the spherical growth function. 
On the other hand, there was a surge of interest in the growth series of groups in 
the 1980s and 1990s. These are the generating functions for the growth function 
and spherical growth function, and they are known to be rational functions for 
several classes of groups. In [1 , Benson proves that the spherical growth series of 
any virtually abelian group is a rational function, and provides an algorithm for 
computing this precisely. But the fact of rationality does not give information about 
the asymptotics of the spherical growth function itself. As a powerful illustration, 
consider Cannon's example (see [4] Example VI. A. 9]): with appropriate generators, 
a certain virtually abelian group (the orientation-preserving part of the Euclidean 
reflection group in the equilateral triangle) has a spherical growth function which 
oscillates between two linear functions, and so never even becomes monotone! Thus 
growth asymptotics such as we compute here are not implied by rationality of the 
growth series. 



3. The limit metric 

When considering presentations (Z d ,S), we will assume throughout that S is 
symmetric, so that S = — S = Si is also the sphere of radius 1 in the group. 

First let us consider G = Z. With any finite generating set, the spheres of 
large radius are divided into a positive part and a negative part, each of uniformly 
bounded diameter in R. In particular, if a is the largest positive element in the 
generating set, then the most efficient spelling of a very large integer uses almost 
exclusively the letter a; in the language we will develop below, ±<z are the only 
significant generators. Let K be the smallest value such that the ball of radius 
K in the word metric contains all of the integers —a < m < a (so that K is a 
constant depending on S). The ball of radius K includes all positive integers up 
to a. The ball of radius K + 1 then includes all positive integers up to 2a, since 
integers between a and 2a can be obtained by adding the generator a. Continuing, 
we see that, for any n > K, the ball of radius n — 1 includes all positive integers 
up to (n — K)a. Thus the sphere of radius n is totally contained in the interval 
((n — K)a, na\ . That means that the positive real numbers ^S n are contained in 
the interval (a(l — K/n), a], and therefore jrS n — > {±a}. This depends on the 
choice of generating set, but in fact only on its convex hull in K. 

For G = 7L d the situation is similar. We will study word-length from a geometric 
point of view. Let S C Z d be a fixed finite set of generators. We adopt additive 
notation, so that every element of Z d has a representative in the form w = «iai + 

0:282 H V a r a r where S = {±ai, . . . , ±a r } and on E Z. Let |w| denote the length 

of w in the word metric, or the minimal ^ \cti \ over all representatives as above. A 
spelling is called a geodesic representative (or a geodesic spelling) if it realizes this 
minimum, since these spellings correspond to geodesic paths in the Cayley graph. 

Let Q be the convex hull in M. d of the generating set S, and let L denote its 
boundary. By construction, L is a centrally symmetric convex polyhedron. Let 
|| • || L denote the Minkowski norm on R d induced by L: this is the unique norm for 
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which L is the unit sphere. Namely, for x G M. d , \\x\\ L equals the unique A > such 
that x G XL. 

For any set M, let AM = {tx : t > 0, x G M} be the infinite cone on M from 
the origin with respect to dilation. Then let AkM = {tx : k — 1 < t < k,x E M} be 
the annular region from the (k — l)st to the fcth dilation, so that the cone M from 
M to the origin is equal to A.%M U {0}. For a a codimension-1 face of L, we will 
call Act the sector associated to ct. Vol denotes the Lebesgue measure on M. d (or 
Area if d = 2). The extreme points of Q are called significant generators. These 
are necessarily elements of the generating set, and it will turn out that many of the 
properties we study in this paper depend only on this subset of S. In particular, we 
will see shortly that the significant generators completely determine the averages of 
size-like functions in the word metric. The first basic observation is that L encodes 
the large-scale geometry of the group with this generating set. 

Lemma 3.1. An element a G S is on the polytope L if and only if na is geodesic 
for every n G Z. An element a G S is an extreme point of Q (or, equivalently, a 
vertex of L) if and only if na is uniquely geodesic for every n G Z. 

Proof. We give the proof for d = 2 for clarity. Suppose a G S is an interior point 
of Q, and suppose it lies in the sector determined by the extreme points b, c G S. 
Then there is a unique positive multiple aa that lies on the segment between b and 
c, so aa = /3b + 7c, with a>l,/3 + 7 = l, with necessarily rational coefficients 
since a, b,c have integer coordinates. But then by clearing common denominators, 
we have an integer multiple of a expressed in terms of b and c with a strictly smaller 
wordlength. This shows that na cannot be geodesic for all n. 

Now suppose na is not geodesic for some n > 1. Then we can write na = a^a^ 
with ^2 ct{ < n. Assume further that n is the smallest such value, so that a itself 
does not appear in this spelling. Then 

l|a|| L <£f IWL<i, 

showing that a is interior to Q. 

For the word na, any alternative spelling with the same length expresses a as a 
convex combination of other generators, and such an expression exists if and only 
if a is not extreme. 

(The proof is the same in arbitrary dimension: replace b, c with the extreme 
points bi, . . . , b„ in a cell of a triangulation.) □ 

Now we establish that the word metric limits to a norm, and that they differ by a 
bounded additive amount. As a consequence, the spheres in the word metric, once 
normalized, converge to a limit shape. This is a small special case of the theory 
for finitely-generated nilpotent groups and, more generally, lattices in Lie groups of 
polynomial growth (see Pansu and Breuillard [9j[2]). Here we give an elementary 
proof in terms of the combinatorial group theory and Euclidean geometry 

Lemma 3.2. For w G 1 d , \\\n\\ l < |w|. 

Proof. We prove that max w6 5 n ||w|| L < n for all n > 1 by induction on n. If n = 1, 
this is immediate from the definition of L. When n > 1, we can always write w G S n 
as w = w' + a where w' G 5„_i and a G S . But then Hw]^ < ||w'|| i + ||a|| L < n. □ 
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We say that v 6 1 d has a simple spelling in terms of the generating set if there 
is a geodesic spelling which uses only significant generators. We will denote by V n 
the set of points in S n which have a simple spelling. 

Definition 3.3. Fix a triangulation of L, and n > 1. For < k < n, let 




a,i non- negative integers, {v.;} bound a simplex, ^ a; = n 



be the set of non-negative integer combinations of the extreme points of simplices, 
with weights summing to n. 

Note that if p G V n , then it lies in nL n S n . This is because p belongs to the 
facet of nL that its extreme points do, so [|p||l = n. Its wordlength is at most n 
because we have a spelling of length n, so Lemma 13 . 2 1 ensures that the wordlength 
is equal to n. Note also that each p G Vn is at distance two in the word metric 
from the other elements that differ by — Vj, and therefore V n is 2-dense in nL 
with respect to the L metric as well. 



a + u a + v 




Figure 3. This figure shows that Q + S = 2Q. 

Lemma 3.4 (Tiles). For any whole number n>2, (n — 1)Q + S = nQ. 

Proof. We know that (n — 1)Q + Q = nQ, by convexity of Q. Now consider 
(n — 1)Q + S. This contains nS, the extreme points of nQ. Consider a, a simplex 
in the (fixed) triangulation of L. For a particular i, the set (n — l)a + v.; is a convex 
set (a copy of (n — l)cr). If the extreme points of a are {v^}, then those translates 
overlap, covering no. Thus (n— 1)L + S includes nL. (See Figure[3]) But since each 
cone a is contained in Q, one can similarly cone off to obtain the desired result. 
That is, 

(n-l)Q + S=(n-l)L + S= (\J(n - l)a) + S = \J ((n - l)a + S) = nQ. 

□ 
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Lemma 3.5 (Bounded difference). There is a constant K = K(S) > 1 such that 
for all w € 1 d , 

H\l<M < H\l + k. 

That contained in the annular region between (n — K)L and nL. 

Proof. Set K = max{|<3 n Z d |}. This is the largest wordlength required to fill in 
the convex hull of the generators; for example, in Figure SI we find K = 3. (Note 
that K can be arbitrarily large as the generating set S varies, but it only depends 
on S.) 

The ball of radius K then contains all of QDZ d , and thus by the previous lemma 
we have that the ball of radius K + 1 contains all of 2Q, and so on until the ball of 
radius n — 1 includes all lattice points in (n—K)Q. But since B n is contained in nQ, 
this precisely means that the sphere is contained in nQ \ (n — K)Q, as required. □ 




Figure 4. The chess-knight metric, with generators 
{(±2,±1), (±1,±2)}. The spheres of radius 1, 2, and 3 are 
shown entirely, as well as the first three dilates of Q. Four points 
from the sphere S4 are shown, marked with stars, to illustrate the 
difference between the norm and the word metric: those points 
have \w\ = 4 while ||iu||z, = 4/3. It takes three steps to fill in the 
lattice points in Q, so Lemma bounded-diff shows that \w\ and 
||w|| never differ by more than 3. 



Proposition 3.6 (Limit shape). As a Gromov-Hausdorff limit, we have 

lim —S n = L. 



Proof. For the forward inclusion, if x n e S n is a sequence, then by the previous 
lemma, 



lim 

n— >oo 



= .i»JNk 

L n-s-oo n 



= i im = 1. 

71— >00 fl 



The reverse inclusion follows from the fact that V n Q S n is 2-dense in nL. 



□ 



THE GEOMETRY OF SPHERES IN FREE ABELIAN GROUPS 



11 



4. The limit measure 

We will study the measure induced on L from the counting measure on S n . First 
we note that the rank-one case is trivial: considering (Z, S) with S = —S, there are 
exactly as many negative integers in S n as positive integers, by symmetry. Thus 
the counting measure on —S n limits to the uniform measure on the two-point set 
L = {±a}. We study d > 2 below. 

Note that -^S n n Act is near a by Lemma [3751 Recall that if fi n , /i denote Borel 
probability measures on a space X, then the following are equivalent ( 5, Thm 2.4]): 

• For every bounded continuous / : X — > R, 



lim / fdfi n = / fdfi 



• For every open U C X , 

liminf/x n ({7) > fi(U). 

n—too 

In this setup, (p n ) is said to converge weakly to \x. On the other hand, (/i n ) is sa id 
to converge strongly if 

lim sup Lu„(A) — fJ>(A)\ = 0. 

Theorem 4.1 (Strong convergence on L). Define measures fi n and fi on L by 

defining, for Lebesgue-measurable sets r C L, 

_ #(^„nf) _ #(5 n nA T ) _ Area(f) 

^ n[T) ~ #5„ ~ #5„ ' MTj -Area(Q)- 
T/ien /i n — > fi strongly. 

From this we will immediately derive the weak convergence of measures on R d 
needed to prove Theorem ll.il Let Af r (A) denote the r-neighborhood of A C M. d . 

Corollary 4.2 (Weak convergence on M. d ). Define measures v n and v on M. d by 

defining, for Lebesgue-measurable sets A C M. d , 

^) = #( "f; nA) ; v{A) = »{AnL). 
Then v n — > v weakly. 

Proof. Let U C M. d be open and set a = U P\ L. Given e > 0, let a' C ct a closed 
subset such that /x(ct') > /i(cr) — e. Then for any metric inducing the standard 
topology, we can take large enough n so that jVk7„(ct') C [/. Then 

liminf v n (U) > lim v n (&') = lim /U n (cr') = fi(<r') > /i(cr) — e — f(J7) — e. 

n— ^oo n— >oo n— >co 

Let e — > to get the desired inequality. □ 

Next we demonstrate that this sufhees to prove the main theorem. 

Proof of Theorem Suppose that / : Z d — > R is asymptotic to a function 
g : M. d — > R that is homogeneous of order fc, meaning that g(ax) — a k g(x), and 
lim — 1. This means that for all e > there exists TV such that 

x i B N (1 - e).g(x) < /(x) < (1 + e)g(x). 
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But then wo have 



n>N => (l-e)£ 9 (x)< £/(x)<(l + £ )£.g(x), 



which gives 



, _ , Exeg„g( x ) E x gs„ /( x ) < n , S x gs„g( x ) 

{ n><\S n \ ~ n k \S n \ - { ' + e) n*\S n \ ' 



Since e was arbitrary, this means that 



Exes„/( x ) ,. Exes„ff( x ) 
lim = lim . 

ra->oo n k \S n \ n->oo n fe |«SVil 



But then g(x)/n k = g(x/n), so 

n fc |S„| |S„| 



g(x) du n (x), 



with respect to the measure defined above. Noting that v n and v are supported on 
the compact set Q, weak convergence finishes the job: 

I™ — ^TuTT ' = lim / 5( x ) di/ "( x ) = / s( x ) dl/ ( x ) = / #( x ) rf Ai( x ) ; 



as desired. □ 

4.1. Rank two. We will prove Theorem l4.1l in this section (d = 2) and the following 
section (d > 2). We begin by giving the necessary counting argument in Z 2 , because 
it has some features that are particular to that dimension. 

In what follows, fix a side a of L and let u and v be the names of its endpoints, 
which are necessarily integer points. The key step in proving Theorem 14.11 is to 
count the number of points of S n in the sector Act over an entire edge. First we 
get control on the geodesic spellings of large words in the sector. 

Lemma 4.3 (Geodesic spellings). There is a uniform bound Dq such that for 
sufficiently large words w in Act, there is a geodesic representative of the form 
w = au + by + w', where |w'| < Dq. 

Proof. Label the elements of S which lie on the line segment between u and v as 
ai, . . . , a r . Then the first task is to show that we can find a geodesic representative 
of the form 

w = au + 6v + aj;a s ; + w", 

where |w"| is bounded. This is true because u, v, and the a^ are the only generators 
whose projection onto a 1 - (the line through the origin that is perpendicular to a) 
is one. We know by Lemma 13.51 that the projection of w onto a 1 - is within K of 
|w|, and thus there is a uniform bound on the number of other generators that can 
appear. 

Next, write each of the ai as + qi ~ Pi v, and let g = \cm{qi}. Then without 
loss of generality, the coefficients ai are at most q. (Otherwise, qai can be rewritten 
as an integer combination of u and v.) Finally, setting w' = E a i a i + w " completes 
the proof. □ 



THE GEOMETRY OF SPHERES IN FREE ABELIAN GROUPS 



13 



Corollary 4.4 (Modifying geodesic spellings). For all but boundedly many words 
w G A n <7 ; there is a geodesic in the Cayley graph from the identity to w which 
passes through the points 

w, w — v, w — 2v, ... w — Km. 

Proof. First, we show that all but boundedly many lattice points in A„ct have 
a geodesic spelling in which the coefficients of u and v are each at least K. We 
just consider the coefficient of v without loss of generality. Note that ||v|| L — 1 by 
definition of the L-norm. Thus words that are spelled w = au+6v+w' with |w'| < Do 
are within b + Dq of the line Au. Let D = K + Dq. Now note that A/r^A^u) has 
diameter 2D + 1, so its number of lattice points is uniformly bounded. This shows 
that w, w — v, . . . , w — Km are all metrically between w and e. 

Finally, if a point is farther than K from a line, then moving it by a distance K 
will not cross over the line. Thus the modified spellings still represent points in the 
sector Act. □ 

Then we get a very clean result: the integer points in A„ct count, up to bounded 
additive error, the quantity we seek. 

Lemma 4.5 (Sphere counting for Z 2 ). 

#(5„nAa)i#(Z 2 nA„a). 

Proof. Let <f>„ : Z 2 — > Z 2 be given by $ n (w) = w — mv, where m = |w| — n. That 
is, it modifies words by subtracting off copies of v when the wordlength differs from 

Now consider applying $„ to words w 6 A n cr. For such words, as long as n 
is sufficiently large and w is £>-far from Au, Corollary 14.41 guarantees that there 
is a geodesic representative using at least K copies of v. But we know that < 
w| — n < K, so this means that $„(w) = w — mv is a point on a geodesic path 
from e to w. Thus |$„(w)| = |w| — (|w| — n) = n, or in other words $„(w) G S n . 

This argument shows that, apart from a bounded number of points, gives 
a bijection from Z 2 n A„tr to S n Pi Act. Injectivity follows from Corollary 14.4) 
surjectivity is established by noting that if |x| = n and ||x|| L = n — k for a point in 
the sector, then x + fcv 6 A„o\ □ 

Proof of Theorem \4- 1\ when d = 2. The region A„ct is a quadrilateral with three of 
its four sides included, whose vertices have integer coordinates. Pick's Theorem 
says that for any polygonal region whose extreme points are integer points, the 
area is equal to the number of integer points in the interior plus half of the integer 
points on the boundary minus one (A = i + 1 — 1 ) . Now A n er contains one of the two 
long boundary segments, and the number of integer points on the short boundary 
segments is uniformly bounded. Therefore, up to additive error, the number of 
integer points in A n o~ is equal to its area. But its area is exactly 

Area(A n a) = n 2 Area(Aia) - (n - l) 2 Area(Aicr) = (2n - 1) Area(<r). 

Thus we have #(S'„ n Act) x (2n — 1) Area(ir), and summing over all sides gives 
#S n = (2n — 1) Area(Q), which shows that 

#(5„nAcr) Area(o-) 
Wn > Area(Q) ' 
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(Note that this also establishes the spherical growth asymptotics for d = 2, as in 
Theorem O) 

To complete the proof it suffices to show that the estimate #(Z 2 n A„r) x 
Area(A n r) is valid for small subarcs tCu. Consider A n r, and approximate it by 
an integer trapezoid T n in the following way: for the two vertices on nL, replace 
them with nearest-possible integer vertices on nL, and likewise for the two vertices 
on (n — 1)L. (T n is nondegenerate for sufficiently large n.) Then it is clear that 
both the area and the number of lattice points in T n are boundedly close to those 
values for A„t, so we are done. □ 

This completes the proof that counting measure limits to cone measure on the 
polygon L. 

4.2. General rank. For general rank d, we will get asymptotic comparisons rather 
than additive difference by carrying out the corresponding estimates. We obtain 
a limit shape L = lim n _ i . 00 —S n in R by taking the boundary polyhedron of the 
convex hull of the generators; we have a limiting distance on Mr via the norm 
induced by L; and finally, we obtain a measure on L as the limit of the counting 
measures on S n , which is proportional to the Euclidean volume subtended by a 
facet. However, we no longer have Pick's Theorem to count the points in regions 
of the facets, so we must replace that part of the argument. 

Recall that the generalization to higher dimensions of Pick's Theorem is by 
Ehrhart polynomials: the number of lattice points in the large dilates of a polytope 
with integer vertices is given by a polynomial formula in the dilation scalar. That 
is, there are coefficients {a{\ depending on P such that 

#(Z d n nP) = {\o\P)n d + a^n*- 1 + . . . + a 

for any natural number n. 

It follows immediately that the number of lattice points in A„cr is asymptotic 
to d ■ n^ 1 Vol(Aicr), by letting P = A^ and noting that A„cr = nP\ (n - 1)P. 

We would like to show that the limit measure on L is uniform on each face. First, 
by triangulating if necessary, we may assume that all the faces of L are simplices 
with integer vertices. 

Lemma 4.6 (Lattice counting for 1 d ). If a is a simplicial (d — \)-cell of L and r 
is a simplex of the same dimension contained in a, then 

#(Z d nA»r) = Vol Air 

n^o #(Z d PI A nC r) Vol AiCT ' 

Proof. First, observe that all the lattice points in A„r are contained in finitely 
many dilates of r. That is, 




Here, q is the number of hypcrplanes parallel to a between a and the origin which 
contain lattice points. To see that this number is finite, consider the formula for the 
distance from a point to a plane, remembering that a having integer vertices means 
that the hyperplane H containing a has an equation with integer coefficients. 

Let Aj be the subset Z d n kjH. Note that all of the Aj are translates of some 
common lattice A = 1 d n H' , where H' is the plane through the origin parallel to 
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H. Let V be the covolume of A and let c be the minimal diameter of a fundamental 
domain for A, which exists because the set of possible diameters is discrete. We 
have 

(*J ~ c)^^il < #(A . n k 3 r) < ( kj + c)^™^, 

and the same inequalities with the same constants holds for a. But n — 1 < kj < n, 
so enlarging c by one, we can write 

, w i Vol Air „.. , . xH iVolAir 
(„ _ c f-i_J_ < #(A . n %r) < (n + c)^ 1 — p^-. 

We can sum over j and get 

,j 1V0IA1T x ,j i Vol Air 

«(« - c ) — ^ < #( z n A « r ) < ?(« + c ) v ■ 

This means that 

#(^nA n r)~ g n-^. 

Of course the same holds for t = a. 
Thus 

#(Z d n A n r) = VolAir 
n^o #(Z d n A„cr) Vol Aid ' 
as required. □ 



The next difference is that Lemma 14751 no longer holds as stated, but is replaced 
by an asymptotic statement. 

Lemma 4.7 (Sphere counting for Z d ). 

#(S„ n Act) = #(Z d n A n a) + 0(n d ~ 2 ). 

To prove this, run the same bijective argument as before on points that are 
outside of a D-neighborhood of the cone on the boundary of a. The count of points 
close to the boundary is clearly lower-order, since they live in a region that measures 
length n in at most d — 2 vector directions, and is bounded in the others. 

This completes the proof of Theorem 14.11 for all d (since the d — 1 case is 
elementary). 

5. Applications 

5.1. Spheres versus balls. From the sphere averages, we can quickly deduce the 
other averaging statement previewed in the introduction. 

Theorem 11.31 (Spheres versus balls). For any function f : 7L d — > R that is asymp- 
totically homogeneous of order k, 

lim i r — rf(x) = [ ) lim —rflx). 

n^oo \B n \ ^ n k JK ' \d+kj n^oo \S n \ ^ n k J W 

That is, the coefficients of growth for sphere averages and ball averages are related 
by the simple expression V 9i s — f d+fcj v g,S- 
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Proof. We will repeatedly use the facts that / ~ g and g(ax) = a k g(x). Since \B n 
is uniformly filling in Q, the first equality is just a Riemann sum. 

Um TW1 E Hfe/M = / 3( x ) d Vol(x) = / «,(tx) [d-t d ~ l dt] dp(x) 
= d-(j^t d + h - x dt\ J g{x) d/i(x) 

□ 

5.2. Higher moments. Two very natural families of asymptotically homogeneous 
functions on Z d are collection of word norms and Minkowski norms. Applying 
Theorem ll.3l to these, we get information on the expected word- norm and expected 
location in R d for elements in the ball B n of radius n: 

Corollary 5.1 (Expectations). For any finite generating set for 7L d , the expected 
geodesic spelling length of words in the ball B n of radius n is gxj^i and the expected 
location in M™ is on the polygon j^-nL where L is the boundary of the convex hull 
of the chosen generating set. 

Thus, the expected position of a word in B n is on $2^/3, independent of the 
choice of generating set. 

To see this in an example, consider (Z 2 , std). As above, set up B n as the union 
of Sj for < j < n, noting that #Sj = Aj for j > 1 and ||x|| = j for x G Sj. Thus 
the average wordlength over the ball is 

4 E" f __ 4n3 + 6 " 2 + 2n 
1 + 4 Ei J ~ 6n 2 + 6n + 3 ' 

which grows like |n, as predicted. Though it was straightforward to calculate this 
directly for the simplest choice of generators, it is not apparent a priori how to 
proceed for an arbitrary generating set. 

More generally we can compute higher moments by applying Theorem ll.3l to the 
functions /(x) = |x| p which are asymptotically homogeneous of order p: 

Corollary 5.2 (Higher moments). The expected value of \x\ p over B„ is -j^- 

This tells us that the higher moments are independent of the choice of word 
metric as well. 

As we should intuitively expect, the sphere/ball theorem when k — (so / is close 
to a scale-invariant function) says that ball-average is equal to the sphere-average. 
This is also the case of any function averaged over a group with exponential growth, 
for a different reason: there, almost all of the points on the ball will be concentrated 
on its boundary sphere, so again the limiting ball-average is equal to the limiting 
sphere-average. 

5.3. Asymptotic density. The counting results can also be used to find the den- 
sity of group elements with a particular property, (P). For instance, we can say 
that a group element w G Z 2 has a simple spelling if w = a\ii + 6vi+i for consecutive 
significant generators. We can verify that every word has a simple spelling with 
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respect to the standard generators, whereas only 1 in 36 elements has a simple 
spelling in S — ±{6ei, ei, 6e2, 62} (compare Figure [21 where the words with simple 
spellings are those appearing on the bounding polygon). Let r be the number of 
sides of the polygon Q and let A — Area(Q). There are rn simple spellings of length 

n, all representing different group elements. On the other hand, #S n x 2An. Thus 
lim #&0J-U — ij m #Cj|ggCP)) _ _r_^ or j n |;h er words: 

Corollary 5.3. For (Z 2 , S), the density of words with simple spellings is r/2A. 

This does depend on the generating set — only on the convex hull, as usual, but 
not only on its area — and it holds uniformly at large word-lengths n, as well as 
when averaging over words of length < n. As a check, recall that Pick's theorem 
says that A = i + 6/2 — 1. We know that r < b and i > 1, which means r/2A < 1, 
which is required for plausibility. Besides recovering the answers above, we also 
see for instance that with respect to the chess- knight generators (see Figure 01 the 
probability of simple spellings is 2/7, which would have been extremely unpleasant 
to derive by hand. 

We can likewise define simple spellings in higher dimensions. Given (Z d , S), form 
L as usual, and let Si, . . . , be the facets of L, regarded as subsets of S via their 
extreme points. (For instance, a pentagon face of some Let 3 corresponds to a 
five-element subset of S.) Then a simple spelling is one of the form 

W = CtiZi 

for some j, with non- negative weights ot^. Using this and our techniques, one could 
obtain a number-theoretic expression for the proportion of simple spellings using 
the number of ordered partitions of large integers n into rij = #£j non-negative 
integers. 

5.4. Sprawl and statistical hyperbolicity. As is well known, the geometric con- 
dition called hyperbolicity (sometimes called word hyperbolicity, 5-hyperbolicity, 
or Gromov hyperbolicity) gives strong algebraic and geometric information about 
groups. Hyperbolicity is a large-scale invariant, so for finitely generated groups, be- 
ing hyperbolic does not depend on the choice of (finite) generating set. However, if 
we formulate a metric condition corresponding to hyperbolicity, then the measure- 
ments themselves depend on generators. We quantify the degree of hyperbolicity 
with a statistic we call the sprawl of a group (with respect to a generating set). We 
give a brief mention here, but develop some results and conjectures in [7]- 

The sprawl of a group measures the average distance between pairs of points 
on the spheres in the word metric, normalized by the radius, as the spheres get 
large. Sprawl thus gives a numerical measure of the asymptotic shape of spheres, 
which can be studied for arbitrary presentations of finitely generated groups. To 
be precise, let 

provided this limit exists. Note that since < d(x, y) < 2n, the value is always 
between and 2. By way of interpretation, note that E = 2 means that one can 
almost always pass through the origin when traveling between any two points on 
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the sphere without taking a significant detour. This statistic is not quasi-isometry 
invariant but nonetheless captures interesting features of the large-scale geometry. 

Hyperbolicity is often characterized with the slogan that "triangles are thin," 
meaning that the third side of a geodesic triangle must stay within bounded distance 
of the other two sides. In terms of x, y 6 S n , this says that the geodesic xy should 
be about as long as d(x, 0) + d(0, y), provided that 0a: and Oy do not fellow-travel. 
Thus, if fellow-traveling is relatively rare in a Gromov hyperbolic space, then we 
will have E = 2. We show in [7J that for any non-elementary hyperbolic group 
with any generating set, E(G, S) = 2. (Recall that a hyperbolic group is called 
elementary if it is finite or has a finite-index cyclic subgroup.) Thus, E < 2 is an 
obstruction to hyperbolicity. We will say that a presentation (G, S) is statistically 
hyperbolic if E(G, S) — 2; this does not imply that G is a hyperbolic group, but only 
that this metric calculation works out on average as though it were. (For example, 
F2 x Z is statistically hyperbolic with respect to its standard generators.) 

We can study statistical hyperbolicity for free abelian groups with the tools de- 
veloped in this paper. For a function of several variables / : (Z d )™ — > R asymptotic 
to g : (Mr) m — > R with g homogeneous of order k, Theorem 11.11 tells us that 



for all finite generating sets S. That means that we know exactly how sprawl 
depends on the generators. (Furthermore, there is an exact algorithm for computing 
this, presented in [7].) With the usual tools for coarse geometry, Z d would be 
indistinguishable from the Euclidean space K d , since they are quasi-isometric. But 
here we can compare the geometry of R 3 to Z 3 with the six standard generators 
(±ei, ±e 2 , ±e 3 ) to Z 3 with the eight generators (±ei ±e 2 ±e 3 ) to the free group F 3 , 
and see that they are arranged from least to most hyperbolic, having E values 4/3 < 
7/5 < 64/45 < 2. Thus sprawl is indeed a tool that detects geometric differences 
between presentations, and allows us to measure their degree of hyperbolicity. 

We end with a corollary which says that no free abelian group can ever be fully 
statistically hyperbolic. 

Corollary 5.4. For any free abelian group Z d with any finite generating set S, the 
sprawl statistic E(1 d ,S) is well-defined and E(1 d ,S) < 2. 

Proof. The sprawl is computed by integrating against a measure that is absolutely 
continuous with Lebesgue measure, and we are integrating a function whose maxi- 
mum value is 2. But a small neighborhood A of any point on the polyhedron has 
positive measure, so A x A has positive measure, and on that set the integrand is 
strictly less than two. □ 
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